Predicting Hit Songs with MIDI Musical Features

نویسنده

  • Keven Wang
چکیده

This paper predicts hit songs based on musical features from MIDI files. The task is modeled as a binary classification problem optimizing for precision, with Billboard ranking as labels. Million Song Dataset (MSD) is inspected audibly, visually, and with a logistic regression model. MSD features is determined too noisy for the task. MIDI files encodes pitch duration as separate instrument tracks, and is chosen over MSD. Fine-grained instrument, melody, and beats features are extracted. Language models of n-grams are used to transform raw musical features into word-document frequency matrices. Logistic Regression is chosen as the classifier, with increased probability cutoff to optimize for precision. An ensemble method that uses both instruments/ melody as well as beats features produces the peak precision 0.882 at probability cutoff 0.998 (recall is 0.279). Alternative models and applications are discussed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Musical features of intermediate complexity for music recognition

To achieve my goal, I developed a theory of musical features inspired by Ullman's Goldilocks theory for visual features. I implemented a computer program that analyzes instrumental (MIDI) versions of the songs to extract relevant parameters for my model. This program also identifies music pieces based on a melody contour of varying length and complexity. Finally I experimentally verified the hy...

متن کامل

Visualization of music collections based on structural similarity

Users interact a lot with their personal music collections, typically using standard text-based interfaces that offer constrained functionalities based on assigned metadata or tags. Alternative visual interfaces have been developed, both to display graphical views of music collections that attempt to reflect some chosen property or organization, or to display abstract visual representations of ...

متن کامل

Mixing Music as Linked Data: SPARQL-based MIDI Mashups

A large number of datasets about music are available today in the Linked Open Data cloud, but most of them only describe music metadata. Datasets representing music notation (i.e. fine-grained musical transcriptions) are scarce, and hence musicians do not have the possibility to exploit Web technologies to their full potential. In particular, this situation hampers the musician’s process of cre...

متن کامل

SPARQL-DJ: The MIDI Linked Data Mashup Mixer for Your Next Semantic Party

Many datasets describing musical resources are published today as Linked Data, mainly focusing on metadata, notation, and audio features. However, the availability of musical data as Linked Data is often not enough for musicians, who need additional layers of software and queries to accomplish their workflows. Mashups are compositions created by blending two or more pre-recorded songs, and it h...

متن کامل

Automatic Transcription of Music

A system for the automatic transcription of music is described. Signal processing methods are introduced that solve different facets of the overall problem. Main emphasis is laid on finding the multiple pitches of concurrent musical sounds. Sound onset detection and musical meter estimation are described to some extent. Other topics discussed are noise robustness, estimation of the number of co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014